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Specification 



1. Title of the- Invention: CONCATENATION METHOD OF PROLONGED 
SOUND 

2. What is claimed is: 

A concatenation method of a prolonged sound where a subsequent 
syllable is a prolonged sound, comprising: 

in a case where amp parameters are used, concatenating 
parameters of a preceding syllable until a terminating frame of 
a prolonged sound section syllable at a constant value; and 

in a case where linear predictive coefficient parameters are 
used, using for a prolonged sound section parameters within a 
syllable file for an independently uttered vowel same as a vowel 
of a preceding syllable, and concatenating by linear 
interpolation these parameters to parameters of a terminating 
frame of a static segment within a syllable file for the vowel 
of the preceding syllable. 

3. Detailed Description of the Invention 
[Industrial Field of the Invention] 

This invention relates to a concatenation method of a prolonged 
sound, in speech synthesis methods using rule-based synthesis. 
[Prior Art] 

A conventional concatenation method of a prolonged sound is 
described with reference to Fig. 4. Fig. 4 shows a conventional 
concatenation method of a prolonged sound using ; amp,.p^arameters . 
In the figure, reference numeral, \31;/den6tes the duration .of a 
preceding syllable, 32 the duration'of a prolonged sound, 33 shows 
the duration of a subsequent syllable, 34 a vowel segment of the 
preceding syllable, 35 a parameter for the starting frame of a 
static segment in the syllable file for the preceding syllable, 
and 36 a parameter for the terminating frame of the static segment 
in the syllable file for the preceding syllable. Reference 
numeral 37 denotes a frame into which an independently uttered 
vowel same as the vowel in the preceding syllable is embedded, 
38 (shaded area) parameters from the head of the syllable file 
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of the independently uttered vowel to the terminating frame 39 
of the static segment, and 39 and 40 the starting and terminating 
frames of the static segment in the syllable file of the 
independently uttered vowel. That is to say, the conventional 
concatenation method of a prolonged sound uses the parameter from 
the syllable file of the independently uttered vowel same as the 
vowel in the preceding syllable for the section corresponding to 
the prolonged sound, and links these parameters with parameters 
in the syllable file for the preceding syllable, as is the case 
with concatenation methods of other syllables. 

In this way, in the conventional concatenation method of a 
prolonged sound, when an . independently uttered vowel of -a 
preceding syllable is substituted into a section corresponding 
to a prolonged sound symbol, it is possible to perform processing 
using an algorithm similar. to those for other ordinary syllables. 
Furthermore, while Fig. 4 shows a case of amp parameters , linear 
predictive coefficient parameters may also be used in a similar 
way. 

[Problem to be Solved by the Invention] 

However, in the conventional method above, an independently 
uttered vowel same as the vowel in the preceding syllable is used 
in the section corresponding to a prolonged sound, resulting in 
the problem of the section not perceptually sounding like a 
prolonged sound . 

This invention provides a concatenation method of a prolonged 
sound that can synthesize" prolonged ;. s builds fusing rule-based 
synthesis . 

[Means for Solving the Problem] 
In order to solve the above problem, this invention provides a 
concatenation method of a prolonged sound where a subsequent 
syllable is a prolonged sound, comprising: in a case where amp 
parameters are used, concatenating parameters of a preceding 
syllable until a terminating frame of a prolonged sound section 
syllable at a constant value; and, in a case where linear 
predictive coefficient parameters are used, using for a prolonged 
sound section parameters within a syllable file for an 
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independently uttered vowel same as a vowel of a preceding 
syllable, and concatenating by linear interpolation, these 
parameters with parameters of a terminating frame of a static 
segment within a syllable file for the vowel of the preceding 
syllable 
[Function] 

The above method of this invention allows creating a parameter 
time series required for synthesizing synthesized . sounds, 
containing prolonged sounds, and obtaining a prolonged sound that 
sounds perceptually natural, through the use of parameters in 
existing syllable files without adding new prolonged sound files 
to the syllable files. 

[Embodiment] 

An embodiment of this invention is described with reference to 
Fig. 1 and Fig. 2. Fig. 1 .shows a' concatenation method using amp 
parameters in an embodiment of the invention, while Fig. 2 shows 
a concatenation method using linear predictive coefficient 
parameters in an embodiment of the invention. In Fig. 1, 
reference numeral 1 denotes the duration of a preceding syllable, 
2 the duration of a prolonged sound, 3 the duration of a subsequent 
syllable, 4 a vowel segment of the preceding syllable, 5 a 
parameter for the starting frame of a static segment in the 
syllable file for the preceding syllable, 6 a parameter for the 
terminating frame of the static, segment in the syllable file for 
the preceding syllable, 7 a section corresponding to the prolonged 
sound, 8 (shown by a point' on the time axis) a^parameter for the 
starting frame of a static segment in the syllable file for the 
prolonged sound section, and 9 a parameter for the terminating 
frame of the static segment in the syllable file for the prolonged 
sound section. As shown in Fig.. 1, in the method using amp 
parameters, the parameter for the terminating frame of the 
preceding syllable 6 is used as a constant value until the 
terminating frame of the static segment in the syllable file for 
the prolonged sound section 3. 

In Fig. 2, which shows a concatenation method using linear 
predictive coefficient parameters, reference numeral II denotes 



the duration of a preceding syllable, 12 the duration of a 
prolonged sound, 13 the duration of a subsequent syllable, 14 a 
vowel segment of the preceding syllable, 15 a parameter for the 
starting frame of a static segment in the syllable file for the 
preceding syllable, 16 a parameter for the terminating frame of 
the static segment in the syllable file for the preceding syllable, 
17 a prolonged sound section, 18 a parameter for the starting frame 
of a static segment in the syllable file for an independently 
uttered vowel same as the vowel in the preceding syllable, and 
19 a parameter for the terminating frame of the static segment 
in the syllable file for the independently uttered vowel. 

As shown in Fig. 2, in .the method using liner predictive 
coefficient parameters, four ..parameters , namely the parameter for 
the starting frame of the static segment in the syllable file for 
the preceding syllable 15, the parameter for the terminating frame 
of the static segment in the same syllable file 16, the parameter 
for the starting frame in the static segment in the syllable file 
for the independently uttered vowel same as the vowel of the 
preceding syllable 18,. and the parameter for the terminating frame 
of the static segment in the same syllable file 19, are used for 
linear interpolation concatenating 15 to 16, 16 to 18, and 18 to 
19. 

In this way, according to the embodiment .of this -invention, the 
concatenation method using amp parameters has the advantage or 
there being little sense of discontinuity because the parameters 
for the preceding syllable are used unchanged until the 
terminating frame of the syllable for the prolonged sound section. 
In addition, the concatenation method using linear predictive 
coefficient parameters has the effect of being able to preserve 
phonological characteristics because the parameters from the 
syllable file for the same independently uttered vowel as the 
vowel in the preceding syllable are used in the prolonged sound 
section. 

Next, another embodiment of this invention is described with 
reference to Fig. 3. Fig. 3 shows a concatenation method using 
amp parameters in another embodiment of this invention where a 



prolonged sound is at the end of a word. In the figure, reference 
numeral 21 denotes the duration of a preceding syllable, 22 the 
duration of a prolonged sound when it is in the middle of a word 
(the same as the duration of a prolonged sound shown by 2, 12, 
and 32 in Figs. 1, 2, and 4) , 23 a vowel segment of the preceding 
syllable, 24 a parameter for the starting frame of a static segment 
in the syllable file for the preceding syllable, 25 a parameter 
for the terminating frame of the static segment in the syllable 
file for the preceding syllable, 26 a prolonged sound section, 
27 a parameter for the starting frame of a static segment in the 
syllable file for the prolonged sound section, and 28 a parameter 
for the terminating frame of .the static segment in the syllable 
file for the prolonged sound section. In contrast to the case 
of Fig. 1, where the parameter would be kept at a constant value 
until the terminating frame of the' static segment of the prolonged 
sound section 28, the amp parameter is kept at a constant value 
until the starting frame of the static segment of the prolonged 
sound section 27, and then subjected to linear interpolation so 
as to be set at 0 at the terminating frame of the static segment 
28, because a prolonged sound is vocalized shorter at the end of 
a word than in the middle of the word. Note that, if linear 
predictive coefficient parameters are used in the same case, the 
concatenation method used is the same as when the prolonged sound 
is in the middle of a word (the concatenation. method shown; in Fig. 
2) . 

By using the above word-end processing, it is possible to obtain 
perceptually natural prolonged sounds. 

[Effect of the Invention] 

According to this invention, it is possible to synthesize 
prolonged sounds that are perceptually extremely natural by a 
concatenation method using amp parameters and linear predictive- 
coefficient parameters within ordinary syllable files used in CV 
(consonant-vowel) syllable concatenation, in rule-based 
synthesis methods, without creating special files for prolonged 
sounds. 
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4. Brief Description of Drawings 

Fig. 1 shows concatenation method using amp parameters in an 
embodiment of this invention. 

Fig. 2 shows a concatenation method using linear predictive 
coefficient parameters in an embodiment of this invention. 

Fig. 3 shows a concatenation method using amp parameters in 
another embodiment of this invention where a prolonged sound is 
at the end of a word. 

Fig. 4 shows a conventional concatenation method using amp 
parameters . 

1, 11, 21: Duration of preceding syllable 

2, 12, 22: Duration of prolonged sound 

3, 13: Duration of subsequent syllable 

4, 14, 23: Vowel segment ..of preceding syllable 

5, 24: Amp parameter for starting frame of static segment in 
syllable file for preceding syllable 

6, 25: Amp parameter for terminating frame of static segment in 
syllable file for preceding syllable 

7, 17, 26: Prolonged sound section 

8, 27: Amp parameter for starting frame of static segment in 
syllable file for prolonged sound section 

9, 28: Amp parameter for terminating frame of static segment in 
syllable file for prolonged soui>d v: section . 

15: Linear predictive coefficient parameter ' for starting frame 
of static segment in syllable file for preceding syllable 
16: Linear predictive coefficient parameter for terminating frame 
of static segment in syllable file' for preceding syllable 
18: Linear predictive coefficient parameter for starting frame 
of static segment in syllable file for independently uttered vowel 
19 : Linear predictive coefficient parameter for terminating frame 
of static segment in syllable file for independently uttered vowel 
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1 : Duration of preceding syllable 

2 : Duration of prolonged sound 

3: Duration of subsequent syllable 

4 : Vowel segment of preceding syllable 

5: Amp parameter for starting frame of 

static segment' in syllable file for 

preceding syllable 
6: Amp parameter for terminating frame of static 

segment in syllable file for preceding syllable . 
7 : Prolonged sound section 

8 : Amp parameter for starting frame of static 
segment in syllable file for prolonged sound 
section 

9: Amp parameter for terminating frame of static 
segment in syllable file for prolonged sound 
section 



FIG. 1 
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11 : Duration of preceding syllable 
12 : Duration of prolonged sound 
13: Duration of subsequent syllable 
14: Vowel segment of preceding syllable 
15: Linear predictive coefficient parameter 
for starting frame of static segment in 
syllable file for preceding syllable 
16: Linear predictive coefficient parameter 
for terminating frame 'bf static segment 
in syllable file for preceding syllable 
1 7 : Prolonged sound section x * 

18: Linear predictive coefficient parameter* 
( for starting frame of static segment in 
syllable file for independently uttered vowel 
19: Linear predictive coefficient parameter 

for terminating frame of static segment in 
syllable file for independently uttered vowel 



FIG. 2 
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21: Duration of preceding syllable 

22 : Duration of prolonged sound 

23: Vowel segment of preceding syllable 

24: Amp parameter for starting frame of 

static segment in syllable file for preceding 

syllable 

25: Amp parameter for terminating frame of static 

segment in syllable file for preceding syllable 
26: Prolonged sound section 

27 : Amp parameter for starting frame of static 

segment in syllable file for prolonged sound 
section 

28: Amp parameter for terminating frame of static 
segment in syllable file for prolonged sound 
section 



FIG. 3 
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31 : Duration of preceding syllable 

32 : Duration of prolonged sound 

33 : Duration of subsequent syllable 

34 : Vowel segment of preceding syllable 

35: Amp parameter for starting frame of static 

segment in syllable file for preceding syllable 
36 : Amp parameter for terminating frame of static 

segment in syllable file for preceding syllable 
37 : Frame in which independently uttered vowel is embedded 
38 : Amp parameter from head of syllable file for 

independently uttered vowel to starting frame of 

static segment 

39: Amp parameter for starting frame of static segment in 
syllable file for independently uttered vowel 

40: Amp parameter for terminating frame of static 

segment in syllable file for independently uttered vowel 



FIG. 4 
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